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ABSTRACT 

We propose an estimator for the trispectrum of a scalar random field on a sphere, discuss its geometrical 
and statistical properties, and outline its implementation. By estimating the trispectrum of the 4 year 
COBE-XyMR data (in HEALPix pixelization) we find new evidence of a non-Gaussian signal associated 
with a known systematic effect. We find that by removing data from the sky maps for those periods of 
time perturbed by this effect, the amplitudes of the trispectrum coefficients become completely consistent 
with predictions for a Gaussian sky. These results reinforce the importance of statistical methods based 
in harmonic space for quantifying non-Gaussianity. 

Subject headings: cosmic microwave background — cosmology: observations 



1. INTRODUCTION 

The Cosmic Microwave Background (CMB) is the clean- 
est window on the origin of structure in the very early uni- 
verse. A complete description of the statistical properties 
of cosmological fluctuations at a redshift z ~ 1000 affords 
us an essential insight into those processes which may have 
seeded the formation of galaxies. In a Gaussian theory of 
structure formation, such as the currently favored model 
of Inflation, the power spectrum contains all the possible 
information about the fluctuations. Any higher order mo- 
ment can subsequently be described in terms of it. How- 
ever, if the theory is non-Gaussian (as expected for struc- 
ture formation theories due to local effects from primor- 
dial phase transitions or more generally from non-linear 
processes), then there will be deviations from the simple 
Gaussian expressions for the higher order moments. Such 
behavior can serve as a powerful discriminator between 
different models of structure formation. 

Most analyses of CMB data to-date have focused on the 
angular power spectrum and its sensitivity to various pa- 
rameters of cosmological theories. Some work has been 
done on the estimation of the three point correlation func- 
tion and its analogue in spherical harmonic space, with 
intriguing results (Heavens 1998, Ferreira, Magueijo and 
Gorski 1998, Magueijo 2000, Banday, Zaroubi and Gorski 
2000). It is the purpose of this letter to propose a method 
for estimating the four point spectrum, the trispectrum, 
and to apply it to the COBE 4 year DMR data. This 
work complements the recent work of Hu (2001) where 
some of the properties of the angular trispectrum of the 
CMB are discussed. 

The outline of this letter is as follows. In Section |^ 
we construct a set of orthonormal estimators and describe 
their properties for a Gaussian random field. In Section |^ 
we apply the estimators to the COBE 4 year DMR data. 
We show that we detect the non- Gaussian signal found in 
Ferreira, Magueijo & Gorski (1998) and that it can be ex- 
plained by the arguments presented in Banday, Zaroubi & 



Gorski (2000), and in particular that this is a manifesta- 
tion of a known systematic effect. We therefore conclude 
that the COBE 4 year data is consistent with a Gaus- 
sian cosmological signal. In Section ^ we summarize our 
results. 

2. THE ESTIMATOR 

In this section we wish to construct a set of quantities 
for estimating the trispectrum of a random field on the 
sphere. The temperature anisotropy in a given direction 
on the celestial sphere, T(n), can be expanded in terms of 
spherical harmonic functions, Yfm(n): 



T(n) 



(1) 



For any theory of structure formation, the airn coefficients 
are a set of random variables; we shall restrict ourselves to 
theories which are statistically homogeneous and isotropic. 
In this case we can define the power spectrum Ce of the 
temperature anisotropics by {0,1^0*^1^1) = CiSw. 

We now seek to construct a set of tensors that are ge- 
ometrically independent, describe their statistical proper- 
tics for a Gaussian random field and then discuss the prac- 
tical issue of their implementation. Given a set of Oim we 
wish to find the index structure of the set of four point 
correlators such that (1) they are rotationally invariant 

(2) they form a complete basis (preferably orthonormal) 
of the whole space of admissible four-point correlators and 

(3) they satisfy the appropriate symmetries under inter- 
changes of m- and values. We shall restrict ourselves to 
the case in which = ^2 = ^3 = ^4 = Furthermore, 
throughout this section we keep £ fixed. We determine the 
tensor T such that 



\a^„ii a£„i2 Oims Olmi , 



(2) 



a=0 



where n=int(i?/3) (due to reflection, permutation and ro- 
tational symmetry). The T^^a values are then the compo- 
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nents of the trispectrum which we wish to estimate. The 
exphcit form of the T are 



'-t -'mi 



m2m3m4 



(3) 



2q 



E (-1 

M=-2a 

2a £ e ' 
—M m3 m4 



^ ^ 2a 



mi m2 M 
incquiv. permutations (4) 



where the matrices in parentheses are the Wigner 3- J sym- 
bols. The T"'^ are not orthogonal and satisfy 



mim2m3m4 mim2rM3m4 



4a + 1 
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^ £ 2a 
^ £ 2/3 



(where summation over the rui is assumed) which has rank 
n+1. The matrix Ci in (H) is a rectangular matrix (with a 
triangular sub-block) with n+1 columns and £+1 rows. It 
is constructed through a Gram-Schmidt procedure by sub- 
tracting for each a (starting from a = 0) the projection 
onto all a' < a and then normalizing the result. The a = 
(and hence a — 0) tensor is proportional to the Gaussian 
contribution. This can be easily seen given that for a = 
the Wigner 3J symbols are simply Kronecker S symbols 
in the corresponding indices. The remaining a > terms 
contain therefore no Gaussian signal and quantify the non- 
Gaussian part of the trispectrum. 

The T are orthonormal and can be used to construct an 
estimator for Ta from a realization of aim'. 



Tf:„ 



mi m2rM3m4 



a^mi aem2 O-lms O-errn 



(5) 



For a Gaussian random field we expect CT^[Tf:o] ^ o''^[ft,a] 
for a > 0, where (j'^[A] denotes the variance of the random 
variable A and Ti-q is simply the square of the minimum 
variance estimator of the Ce. One finds that (Ti-a) — 
and a^[fi.a] = 2iCf for all a > 0. 

To show that the Tg-a constitute a family of minimum 
variance estimators we construct a linear combination of 
the estimators: 



mim2m3m4 

and minimize the function 

2 



a=0 



(6) 



~ {%7iim2m3m4 ^^mi fl£m2 0£m3 dirrii ) 
V mim2m3m4 mim2m3m4 / 



(7) 



where summation over all nii is implied. The last term, 
a Lagrange multiplier, ensures that is normalized. We 
solve dc^aWca^y\ = dxallca^M, = to find a set of two 
equations 



where 



(24/ + liAtY^Cb + Acb = and c^ = 1 



Aab rj-a^l 'fb-^ 



(8) 



This is an eigenvector equation where, for a given eigen- 
vector c, the eigenvalue A will give the expected variance 
of the estimator. Of the n+1 eigenvalues, one is large and 
has an eigenvector proportional to T^-fl. The remaining 
eigenvalues have an amplitude A = 24 and each eigenvec- 
tor is a Tt^a for a > 0. 

Note that we can relate our parameterization to the one 
proposed in Hu (2001); If we reexpress equation (0) as 



\a^mi Of m2 a^m3 Of m4, 



(9) 



0=0 



where T^-a — ^i^T^ia then Q^^ as defined in equation 15 
of Hu (2001) can be written as 



g«(2a) = Tt,a + 2(4a + 1) 



P 



£ £ 2a 
£ £ 2j3 



(10) 



The numerical implementation of these estimators is 
more involved than for the bispectrum. If we omit the 
numerous symmetries, we have to consider for each £ a set 
of up to 8-^^ Wigner 3J symbols (compared to just one for 
the bispectrum). There are reasonably fast ways for con- 
structing the Wigner 3J symbols (Schulten and Gordon 
1976) but the number of operations per estimator scales 
as 0{£^). For repeated computations of the estimators 
(eg. in Monte Carlo studies) , this can partially be avoided 
by storing the precomputed estimators in a lookup table, 
with the amount of memory required scaling as 0{£^). 

Clearly, to be able to estimate the trispectrum on small 
angular scales, approximate methods must be developed 
to make the procedure computationally feasible. However, 
the ability to constrain non-Gaussianity on large angular 
scales is in any case more important physically for two rea- 
sons; the ratio of the non-Gaussian to the Gaussian signal 
will in general be higher for lower moments, and the signal 
to noise is better for low I. To understand these points, let 
us assume a source for non-Gaussianity which leads to ap- 
proximately scale invariant moments of the gravitational 
potential on arbitrary scales, i.e. ($(i?)^) is constant for 
any R, where 3>(i?) is the gravitational potential within a 
ball of radius R and (• • •) denotes the ensemble average. 
This might be expected from a primordial source with no 
preferred scale such as inflation (Komatsu & Spergel 2000) 
or from an active source where the only scale is set by the 
horizon today (Durrer et al 2000). Current observations of 
the CMB certainly favor such scale-invariant descriptions 
of the potential. One then expects the moment of order 
N of the agm to scale as £2(i-JV)_ q^j^jg signal will be com- 
peting against the fluctuations due to the disconnected (or 
Gaussian) part, which is proportional to A^!£~(^^+^)/^, the 
former therefore dominating for N > 2. Since the power 
spectrum for white noise has constant amplitude, the sig- 
nal to noise as a function of scale will have the same form 
as the scale invariant power spectrum itself, therefore be- 
ing larger for smaller /, ie. larger angular scales. 
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3. RESULTS 

As an application of the formalism described in Sec- 
tion H, we estimate the trispectrum of the coadded 53 
and 90 GHz COBE-DMR 4 year sky maps in HEALPix 
format (Gorski et al 1999). The resolution of the maps 
is -^gide ~ ^''^ 49152 pixels. We do not extend om' 
analysis beyond ^max — 20 since the signal to noise is 
poor for higher I. Hence the maximal number of inde- 
pendent non-Gaussian estimators for the trispectrum is 
int(^niax/3) — 6. We set the pixels in the extended Galac- 
tic cut (Banday et al 1997) to zero and subtract the resid- 
ual monopole and dipole of the resulting map. After con- 
volving the maps with spherical harmonics to extract a set 
of aim's for Z < 20 we then apply equation (|^). To vali- 
date our software, we have estimated the bispectrum of the 
COBE-T)MR 4 year sky data repixelized in the HEALPix 
format (for convenience denoted by EC) and reproduced 
the results of Ferreira, Magueijo & Gorski (1998), and 
in particular the strong non-Gaussian signal present at 
t — 16. When an equivalent map, from which that part 
of the DMR time stream contaminated by the 'eclipse ef- 
fect^' is removed (denoted nec), is subsequently analyzed 
we also reproduce the results of Banday, Zaroubi & Gorski 
2000, namely that the non-Gaussian signal is no longer 
detected. For our subsequent analysis we will present the 
trispectra of both the EC and NEC data. 

One of our primary concerns is to compare our re- 
sults with the assumption that the CMB sky measured by 
COBE-T)MR is Gaussian. To do so, we generate 10000 
full-sky maps at the same resolution using a scale in- 
variant power spectrum normalized to Qrms-PS = 18/iK 
(Gorski et al 1998). We convolve each map with the DMR 
beam and add uncorrelated pixel noise with rms ampli- 
tude cr„ — 1 5 . 95mK / \/Nobs , (where Nobs is the number 
of times a given pixel was observed); we then subject the 
synthetic map to the same procedure as the original data. 
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Fig. 1. — The six estimators of the normalized trispectrum 
applied to the EC data (circles) and the NEC data (crosses). 
95% of all simulated Gaussian skies lie within the solid lines. 
Although removing the 'eclipse' data changes the noise prop- 
erties, we find that the Gaussian confidence limits essentially 
remain unchanged. 



Figure 1 shows the trispectra of the DMR data together 
with Gaussian 95% confidence limits. Instead of the "raw" 
estimator (^) we prefer to use the normalized trispectrum, 

T^"^ = Ti-a/Cf for a > 1 (where Ci = 2FPT 1"^™!^)' 
thus effectively removing the dependence on the power 
spectrum. This prevents fluctuations in the power spec- 
trum from introducing spurious signals and from mask- 
ing real non-Gaussianities. Figure 1 shows that in this 
case, most values fall within the 95% confidence lines and 
demonstrate the scatter expected for a Gaussian random 
field. 

Of particular interest is the value of the normalized r*^^^ 
at £ = 16 in figure 1. One finds that 99.9% of the Gaus- 
sian models in the EC case have a smaller t'-^^ than the 
measured one. This is clearly a manifestation of the non- 
Gaussianity found in Ferreira, Magueijo & Gorski (1998) 
which is highly localized in £ space. However, if we es- 
timate r'^^ for the NEC we find that it falls comfortably 
within the 95% confidence limits. This leads us to be- 
lieve that this detection of non-Gaussianity results from 
the 'eclipse effect', consistent with the hypothesis of Ban- 
day, Zaroubi & Gorski (1999). 
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Fig. 2. — The distribution of the Gaussian models (his- 
togram) and the actual data value (dotted line) for the EC 
(top graphs) and NEC (bottom graphs) datasets. The left two 
graphs show r'^' which contains the main contribution to the 
non-Gaussian signal and the right graphs show the total 
over all six non-Gaussian estimators, r*-^' to r'®-*. 

Let us now construct a goodness of fit for our statis- 
tic. In Ferreira, Magueijo & Gorski (1998), a modified 
was constructed which took into account the non-Gaussian 
distribution of each method: as above, the distribution of 
each estimator for a Gaussian sky was constructed and 
used as an approximate likelihood function to evaluate 
the goodness of fit. One shortcoming of such a method 



^The 'eclipse effect' was an orbitally modulated signal which took place for approximately two months every year around the June solstice 
when the COBE spacecraft repeatedly flew through the Earth's shadow. 
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was that correlations between the estimates for different 
were discarded. To include them, we use the Gaussian 
ensemble of data sets to derive the expectation values <>g 
and the covariance matrix C for both the power spectrum, 

Ci, and all seven trispectrum estimators, rf^ to Tf \ We 
proceed to calculate the value for the estimator £ and 
the data set V, 



X 



(11) 

using as data sets the EC data and the NEC data. Finally 
we use another 10000 Gaussian realizations to estimate 
the expected distribution of the ^oi both the EC and 
the NEC data. 

For all normalized non-Gaussian trispectrum estimators 
(r^^) to T^^^) we find that 94% of the Gaussian models have 
a smaller than the EC data as can be seen in figure 2. As 
expected the main contribution to the for the EC data 
stems from r^'^) at ^ = 16; indeed, this is the only normal- 
ized trispectrum estimator which exhibits any significant 
non-Gaussianity, in this case at about 99.9%. If we use 
the NEC data, the detection vanishes. In this case, 60% of 
all Gaussian models have a lower when computed over 
all six trispectrum estimators (83% for t^^^ alone). Hence 
the NEC data is compatible with Gaussianity. 

4. DISCUSSION 

In this paper, we have derived an estimator for the 
trispectrum of a scalar random field on the sphere. Appli- 
cation of this estimator, normalized by the power spectrum 
(a procedure adopted in Ferreira, Magueijo & Gorski, 1998 
for the bispectrum, see also Komatsu et al 2002 for a de- 
tailed discussion), to the COBE-XyMR data provides evi- 
dence for non-Gaussianity at the 94% confidence level. As 
in the case of the bispectrum, the signal is mainly present 
in the £ — 16 multipole (and the t^^^ estimator here). 
However, when data is excluded to correct for the 'eclipse 



effect', the non-Gaussian behavior is removed, allowing us 
to conclude that the non-Gaussianity present in the un- 
corrected sky maps is not cosmological in origin. 

The detection of a signal that is so strongly localized 
in £ space provides convincing support to our contention 
that the trispectrum is an important and sensitive probe 
of non-Gaussianity in the frequency (scale) domain. It af- 
fords complementary information to the bispectrum since 
it is an even moment, and, despite the higher computa- 
tional effort required, has the obvious advantage in that it 
can probe all values of £, not just the even ones. 

Interestingly enough, from a theoretical perspective, 
there may be some possible sources of non-Gaussianity for 
which the trispectrum provides a far more sensitive test 
than the bispectrum. In many cases a given moment of 
the a£„jS can be expressed as the projection of a cosmo- 
logical field. If that field is vector-like in nature (as in the 
case of the Dopplcr effect or the Ostriker-Vishniac effect 
and its non- linear extensions), any odd moment may suffer 
from the Sunyaev-Kaiser cancellation, where the integral 
of a given wavenumber, k, over a smoothly varying projec- 
tion function with width a tends to suppress the moment 
by a factor of order l/{ak)'^ (Sunyaev 1978, Kaiser 1985, 
Scannapieco 2000). For even moments one can always con- 
struct a scalar component which will not be subject to this 
cancellation. Such a tool will be of great use in the anal- 
ysis of the data sets from the MAP and Planck Surveyor 
satellites. 
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Note added in proof - E. Komatsu investigates the trispectrum of the COBE DMR data in his Ph. D. thesis. His 
conclusions agree with ours, namely that the COBE data is consistent with Gaussian initial fluctuations. 



